Elastic-Degenerate String Matching via Fast Matrix Multiplication

نویسندگان

چکیده

An elastic-degenerate (ED) string is a sequence of $n$ sets strings total length $N$ which was recently proposed to model set similar sequences. The ED matching (EDSM) problem find all occurrences pattern $m$ in an text. EDSM has received some attention the combinatorial community, and $\mathcal{O}(nm^{1.5}\sqrt{\log m} + N)$-time algorithm known [Aoyama et al., CPM 2018]. standard assumption prior work on this question that substantially larger than both $m$, thus we would like have linear dependency former. Under assumption, natural open whether can decrease 1.5 exponent time complexity, similarly as related (but, best our knowledge, not equivalent) word break [Backurs Indyk, FOCS 2016]. Our starting point conditional lower bound for problem. We use popular Boolean matrix multiplication (BMM) conjecture stating there no truly subcubic BMM [Abboud Williams, 2014]. By designing appropriate reduction, show solving $\mathcal{O}(nm^{1.5-\epsilon} N)$ time, any $\epsilon>0$, refutes conjecture. reduction should be understood indication decreasing requires fast multiplication. String periodicity Fourier transform are two tools algorithms. main technical contribution successfully combine these with design noncombinatorial $\tilde{\mathcal{O}}(nm^{\omega-1}+N)$-time EDSM, where $\omega$ denotes $\tilde{\mathcal{O}}(\cdot)$ notation suppresses polylog factors. To first tools. In particular, using fact $\omega<2.373$ [Alman SODA 2021; Le Gall, ISSAC 2014; STOC 2012], obtain $\mathcal{O}(nm^{1.373} EDSM. important building block solution might applications other problems method selecting small length-$\ell$ substrings pattern, called anchors, so occurrence from text contains at least one but too many (on average) such anchors inside.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deterministic and Probabilistic Algorithms for Maximum Bipartite Matching Via Fast Matrix Multiplication

Let G = (S, T, E) be a bipartite-graph, where S U T is the set of nodes (S n T = 8) and E is the set of edges, E c S X T. Let S = {ur , . . . . II,}, T = {VI, . . . . vt} (A t), and 1 E I= e. An (S, T) matching is a subset M of E such that no two edges in M have a common endpoint. A maximum matching is a matching of maximum cardinality. The set of nodes which take part in such a maximum matchin...

متن کامل

Fast Matrix Multiplication

We give an overview of the history of fast algorithms for matrix multiplication. Along the way, we look at some other fundamental problems in algebraic complexity like polynomial evaluation. This exposition is self-contained. To make it accessible to a broad audience, we only assume a minimal mathematical background: basic linear algebra, familiarity with polynomials in several variables over r...

متن کامل

Efficient Pattern Matching in Elastic-Degenerate Strings

In this paper, we extend the notion of gapped strings to elastic-degenerate strings. An elastic-degenerate string can been seen as an ordered collection of k > 1 seeds (substrings/subpatterns) interleaved by elastic-degenerate symbols such that each elastic-degenerate symbol corresponds to a set of two or more variable length strings. Here, we present an algorithm for solving the pattern matchi...

متن کامل

Generating Random Spanning Trees via Fast Matrix Multiplication

We consider the problem of sampling a uniformly random spanning tree of a graph. This is a classic algorithmic problem for which several exact and approximate algorithms are known. Random spanning trees have several connections to Laplacian matrices; this leads to algorithms based on fast matrix multiplication. The best algorithm for dense graphs can produce a uniformly random spanning tree of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: SIAM Journal on Computing

سال: 2022

ISSN: ['1095-7111', '0097-5397']

DOI: https://doi.org/10.1137/20m1368033